Detecting Misleading Information on COVID-19

IEEE Access. 2020 Sep 9:8:165201-165215. doi: 10.1109/ACCESS.2020.3022867. eCollection 2020.

Abstract

This article addresses the problem of detecting misleading information related to COVID-19. We propose a misleading-information detection model that relies on the World Health Organization, UNICEF, and the United Nations as sources of information, as well as epidemiological material collected from a range of fact-checking websites. Obtaining data from reliable sources should assure their validity. We use this collected ground-truth data to build a detection system that uses machine learning to identify misleading information. Ten machine learning algorithms, with seven feature extraction techniques, are used to construct a voting ensemble machine learning classifier. We perform 5-fold cross-validation to check the validity of the collected data and report the evaluation of twelve performance metrics. The evaluation results indicate the quality and validity of the collected ground-truth data and their effectiveness in constructing models to detect misleading information.

Keywords: COVID-19; Coronavirus; SARS-CoV-2; WHO; fake news detection; infodemic; misleading information; pandemic; social media; social networks; text classification; text mining; web mining.